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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Bandman, Olga 

Guegler, Karl J. 
Corley, Neil C. 
Shah, Purvi 

(ii) TITLE OF THE INVENTION: NEW HUMAN CATHEPSIN 



(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

<C) REFERENCE /DOCKET NUMBER: PF-0331 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 334 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THYMNOT02 

(B) CLONE: 347021 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Asn Leu Ser Leu Val Leu Ala Ala Phe Cys Leu Gly lie Ala Ser 

15 10 15 

Ala Val Pro Lys Phe Asp Gin Asn Leu Asp Thr Lys Trp Tyr Gin Trp 

20 25 30 

Lys Ala Thr His Arg Arg Leu Tyr Gly Ala Asn Glu Glu Gly Trp Arg 

35 40 45 

Arg Ala Val Trp Glu Lys Asn Met Lys Met lie Glu Leu His Asn Gly 

50 55 60 

Glu Tyr Ser Gin Gly Lys Leu Gly Phe Thr Met Ala Met Asn Ala Phe 
65 70 75 80 

Gly Asp Met Thr Asn Glu Glu Phe ^Arg Gin Met Met Gly Cys Phe Arg 

85 90 95 

Asn Gin Lys Phe Arg Lys Gly Lys Val Phe Arg Glu Pro Leu Phe Leu 

100 105. 110 

Asp Leu Pro Lys Ser Val Asp Trp Arg Lys Lys Gly Tyr Val Thr Pro 

115 120 125 

Val Lys Asn Gin Lys Gin Cys Gly Ser Cys Trp Ala Phe Ser Ala Thr 

130 135 140 

Gly Ala Leu Glu Gly Gin Met Phe Arg Lys Thr Gly Lys Leu Val Ser 
145 150 155 " 160 

Leu Ser Glu Gin Asn Leu Val Asp Cys Ser Arg Pro Gin Gly Asn Gin 

165 170 175 

Gly Cys Asn Gly Gly Phe Met Ala Arg Ala Phe Gin Tyr Val Lys Glu 

180 185 190 

Asn Gly Gly Leu Asp Ser Glu Glu Ser Tyr Pro Tyr Val Ala Val Asp 

195 200 205 

Glu lie Cys Lys Tyr Arg Pro Glu Asn Ser Val Ala Asn Asp Thr Gly 

210 215 220 

Phe Thr Met Val Ala Pro Gly Lys Glu Lys Ala Leu Met Lys Ala Val 
225 230 235 " 240 

Ala Thr Val Gly Pro lie Ser Val Ala Met Asp Ala Gly His Ser Ser 

245 250 255 

Phe Gin Phe Tyr Lys Ser Gly lie Tyr Phe Glu Pro Asp Cys Ser Ser 

260 265 270 

Lys Asn Leu Asp His Gly Val Leu Val Val Gly Tyr Gly Phe Glu Gly 

275 280 285 

Ala Asn Ser Asn Asn Ser Lys Tyr Trp Leu Val Lys Asn Ser Trp Gly 

290 295 300 

Pro Glu Trp Gly Ser Asn Gly Tyr Val Lys lie Ala Lys Asp Lys Asn 
305 310 315 320 

Asn His Cys Gly He Ala Thr Ala Ala Ser Tyr Pro Asn Val 
325 330 

(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THYMNOT02 

(B) CLONE: 347021 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CTCAGAGGCT TGTTTGCTGA GGGTGCCTGC GCAGCTGCGA CGGCTGCTGG TTTTGAAACA 60 

TGAATCTTTC GCTCGTCCTG GCTGCCTTTT GCTTGGGAAT AGCCTCCGCT GTTCCAAAAT 120 

TTGACCAAAA TTTGGATACA AAGTGGTACC AGTGGAAGGC AACACACAGA AGATTATATG 180 

GCGCGAATGA AGAAGGATGG AGGAGAGCAG TGTGGGAAAA GAATATGAAA ATGATTGAAC 240 

TGCACAATGG GGAATACAGC CAAGGGAAAC TTGGCTTCAC AATGGCCATG AATGCTTTTG 300 

GTGACATGAC CAATGAAGAA TTCAGGCAGA TGATGGGTTG CTTTCGAAAC CAGAAATTCA 3 60 

GGAAGGGGAA AGTGTTCCGT GAGCCTCTGT TTCTTGATCT TCCCAAATCT GTGGATTGGA 420 

GAAAGAAAGG CTACGTGACG CCAGTGAAGA ATCAGAAACA GTGTGGTTCT TGTTGGGCTT 480 

TTAGTGCGAC TGGTGCTCTT GAAGGACAGA TGTTCCGGAA AACTGGGAAA CTTGTCTCAC 540 

TGAGCGAGCA GAATCTGGTG GACTGTTCGC GTCCTCAAGG CAATCAGGGC TGCAATGGTG 600 

GCTTCATGGC TAGGGCCTTC CAGTATGTCA AGGAGAACGG AGGCCTGGAC TCTGAGGAAT 660 

CCTATCCATA TGTAGCAGTG GATGAAATCT GTAAGTACAG AC CTGAGAAT TCTGTTGCTA 720 

ATGACACTGG CTTCACAATG GTCGCACCTG GAAAGGAGAA GGCCCTGATG AAAGCAGTCG 780 

CAACTGTGGG GCCCATCTCC GTTGCTATGG ATGCAGGCCA TTCGTCCTTC CAGTTCTACA 840 

AATCAGGCAT TTATTTTGAA CCAGACTGCA GCAGCAAAAA CCTGGATCAT GGTGTTCTGG 9 00 

TGGTTGGCTA CGGCTTTGAA GGAGCAAATT CGAATAACAG CAAGTATTGG CTCGTCAAAA 960 

ACAGCTGGGG TCCAGAATGG GGCTCGAATG GCTATGTAAA AATAGCCAAA GACAAGAACA 1020 

ACCACTGTGG AATCGCCACA GCAGCCAGCT ACCCCAATGT GTGAGCTGAT GGATGGTGAG 1080 

GAGGAAGGAC TTAAGGACAG CATGTCTGGG GAAATTTTAT CTTGAAACTG ACCAAACGCT 1140 

TATTGTGTAA G AT AAAC C AG TTGAATCATG GAGGATCCAA GTTGAGATTT TAATTCTGTG 12 00 

ACATTTTTAC AAGGGTAAAA TGTTAC C ACT ACTTTAATTA TTGTTATACA CAGCTTTATG 1260 

ATATCAAAGA CTCATTGCTT AATTCTAAGA CTTTTGAATT TTCATTTTTT AAAAAGATGT 1320 

ACAAAACAGT TTGAAATAAA TTTTAATTCG TATATAAAAA AAAAAA 13 66 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 29715 

(B) CLONE: GenBank 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Asn Pro Thr Leu lie Leu Ala Ala Phe Cys Leu Gly lie Ala Ser 

15 10 15 

Ala Thr Leu Thr Phe Asp His Ser Leu Glu Ala Gin Trp Thr Lys Trp 
20 25 30 
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Lys Ala Met His Asn Arg Leu Tyr Gly Met Asn Glu Glu Gly Trp Arg 

35 40 45 

Arg Ala Val Trp Glu Lys Asn Met Lys Met lie Glu Leu His Asn Gin 

50 55 60 

Glu Tyr Arg Glu Gly Lys His Ser Phe Thr Met Ala Met Asn Ala Phe 
65 70 75 80 

Gly Asp Met Thr Ser Glu Glu Phe Arg Gin Val Met Asn Gly Phe Gin 

85 90 95 

Asn Arg Lys Pro Arg Lys Gly Lys Val Phe Gin Glu Pro Leu Phe Tyr 

100 105 110 

Glu Ala Pro Arg Ser Val Asp Trp Arg Glu Lys Gly Tyr Val Thr Pro 

115 120 125 

Val Lys Asn Gin Gly Gin Cys Gly Ser Cys Trp Ala Phe Ser Ala Thr 

130 135 140 

Gly Ala Leu Glu Gly Gin Met Phe Arg Lys Thr Gly Arg Leu lie Ser 
145 150 155 160 

Leu Ser Glu Gin Asn Leu Val Asp Cys Ser Gly Pro Gin Gly Asn Glu 

165 170 175 

Gly Cys Asn Gly Gly Leu Met Asp Tyr Ala Phe Gin Tyr Val Gin Asp 
180 185 190 
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Asn Gly Gly Leu Asp Ser Glu Glu Ser Tyr Pro Tyr Glu Ala Thr Glu 

195 200 205 

Glu Ser Cys Lys Tyr Asn Pro Lys Tyr Ser Val Ala Asn Asp Thr Gly 

210 215 220 

Phe Val Asp lie Pro Lys Gin Glu Lys Ala Leu Met Lys Ala Val Ala 
225 230 235 240 

Thr Val Gly Pro lie Ser Val Ala lie Asp Ala Gly His Glu Ser Phe 

245 250 255 

Leu Phe Tyr Lys Glu Gly lie Tyr Phe Glu Pro Asp Cys Ser Ser Glu 

260 265 270 

Asp Met Asp His Gly Val Leu Val Val Gly Tyr Gly Phe Glu Ser Thr 

275 280 285 

Glu Ser Asp Asn Asn Lys Tyr Trp Leu Val Lys Asn Ser Trp Gly Glu 

290 295 300 

Glu Trp Gly Met Gly Gly Tyr Val Lys Met Ala Lys Asp Arg Arg Asn 
305 310 315 320 

His Cys Gly lie Ala Ser Ala Ala Ser Tyr Pro Thr Val 
325 330 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 1468964 

( B ) CLONE : GenBank 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Pro Ser Leu Phe Leu Thr Ala Leu Cys Leu Gly lie Ala Ser 

15 10 15 

Ala Ala Pro Lys Leu Asp Gin Asn Leu Asp Ala Asp Trp Tyr Lys Trp 

20 25 30 

Lys Ala Thr His Gly Arg Leu Tyr Gly Met Asn Glu Glu Gly Trp Arg 

35 40 45 

Arg Ala Val Trp Glu Lys Asn Met Lys Met lie Glu Leu His Asn Gin 

50 55 60 

Glu Tyr Ser Gin Gly Lys His Gly Phe Ser Met Ala Met Asn Ala Phe 
65 70 75 80 

Gly Asp Met Thr Asn Glu Glu Phe Arg Gin Val Met Asn Gly Phe Gin 

85 90 95 

Asn Gin Lys His Lys Lys Gly Lys Val Phe His Glu Ser Leu Val Leu 

100 105 110 

Glu Val Pro Lys Ser Val Asp Trp Arg Glu Lys Gly Tyr Val Thr Ala 

115 120 125 

Val Lys Asn Gin Gly Gin Cys Gly Ser Cys Trp Ala Phe Ser Ala Thr 

130 135 140 

Gly Ala Leu Glu Gly Gin Met Phe Arg Lys Thr Gly Lys Leu Val Ser 
145 150 155 160 

Leu Ser Glu Gin Asn Leu Val Asp Cys Ser Arg Pro Gin Gly Asn Gin 
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Gly Cys Asn Gly 
180 

Asn Gly Gly Leu 
195 

Thr Asn Ser Cys 
210 

Gly Phe Val Asp 
225 

Ala Thr Val Gly 

Phe Gin Phe Tyr 
260 

Lys Asp Leu Asp 
275 

Thr Asp Ser Asn 
290 

Pro Glu Trp Gly 
305 

Asn His Cys Gly 



165 

Gly Leu Met Asp 

Asp Thr Glu Glu 
200 

Thr Tyr Lys Pro 
215 

lie Pro Gin Arg 
230 

Pro He Ser Val 
245 

Lys Ser Gly He 

His Gly Val Leu 
280 

Ser Ser Lys Phe 
295 

Trp Asn Gly Tyr 
310 

He Ser Thr Ala 
325 



170 

Asn Ala Phe Gin 
185 

Ser Tyr Pro Tyr 

Glu Cys Ser Ala 
220 

Glu Lys Ala Leu 

235 

Ala He Asp Ala 
250 

Tyr Tyr Asp Pro 
265 

Val Val Gly Tyr 

Trp He Val Lys 
300 

Val Lys Met Ala 
315 

Ala Ser Tyr Pro 
330 







175 




Tyr 


Val 


Lys 


Asp 




190 






Leu 


Gly 


Arg 


Glu 


205 








Ala 


Asn 


Asp 


Thr 


Met 


Lys 


Ala 


Val 








240 


Gly 


His 


Ser 


Ser 






255 




Asp 


Cys 


Ser 


Ser 




270 






Gly 


Phe 


Glu 


Gly 


285 








Asn 


Ser 


Trp 


Gly 


Lys 


Asp 


Gin 


Asn 








320 


Thr 


Val 
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